Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
En | 2769 | 160 | 1 | 160.0000 |
Les | 6574 | 540 | 5 | 108.0000 |
Des | 947 | 70 | 1 | 70.0000 |
Mais | 2133 | 67 | 1 | 67.0000 |
Et | 1437 | 60 | 1 | 60.0000 |
La | 6853 | 533 | 10 | 53.3000 |
Cette | 1276 | 103 | 2 | 51.5000 |
Un | 1824 | 147 | 3 | 49.0000 |
Ce | 1814 | 93 | 2 | 46.5000 |
Le | 10177 | 636 | 15 | 42.4000 |
Il | 4855 | 136 | 4 | 34.0000 |
Au | 1001 | 68 | 2 | 34.0000 |
Une | 1691 | 134 | 4 | 33.5000 |
Depuis | 479 | 31 | 1 | 31.0000 |
Dans | 1399 | 27 | 1 | 27.0000 |
Pour | 1977 | 69 | 3 | 23.0000 |
Ils | 946 | 46 | 2 | 23.0000 |
On | 1442 | 68 | 3 | 22.6667 |
Je | 1531 | 63 | 3 | 21.0000 |
Avec | 518 | 21 | 1 | 21.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
face | 1174 | 3 | 52 | 0.0577 |
moteur | 125 | 1 | 11 | 0.0909 |
2006 | 1609 | 15 | 138 | 0.1087 |
type | 204 | 1 | 9 | 0.1111 |
solide | 75 | 1 | 8 | 0.1250 |
vague | 80 | 1 | 8 | 0.1250 |
structure | 108 | 1 | 7 | 0.1429 |
décide | 75 | 1 | 7 | 0.1429 |
Ségolène | 144 | 1 | 7 | 0.1429 |
prêt | 171 | 2 | 13 | 0.1538 |
millions | 1726 | 21 | 127 | 0.1654 |
acte | 120 | 2 | 12 | 0.1667 |
gagnant | 66 | 1 | 6 | 0.1667 |
langues | 48 | 1 | 6 | 0.1667 |
cessé | 47 | 1 | 6 | 0.1667 |
types | 42 | 1 | 6 | 0.1667 |
prestations | 75 | 1 | 6 | 0.1667 |
Olympiques | 37 | 1 | 6 | 0.1667 |
cinquantaine | 67 | 1 | 6 | 0.1667 |
partenaire | 91 | 1 | 6 | 0.1667 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II